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The height and weight are the fundamental indices of 
human growth, which are strongly affected by various 
factors such as genetic effects, race, nutrition, congenital 
disease, and so on. The relationship between the indices 
and physical conditions has been extensively studied to 
establish a simple measure, such as the body mass index 
(BMl).^"'^) One of the serious health problems is obe- 
sity which causes various diseases such as cardiovascular 
disease and diabetes. Thus, investigating the statistical 
properties of height and weight from the viewpoint of the 
human growth process is important for the prevention of 
adult diseases. 

For the statistical property of height, it has been com- 
monly believed that the height distribution obeys the 
normal distribution,^' 
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where is the frequency divided by the total num- 
ber of individuals, x is the height, and and cr are the 
average and the standard deviation, respectively. If the 
height distribution were approximated by the normal dis- 
tribution, it would suggest that the growth of individuals 
is governed by the stochastic additive process because 
the sum of independent random variables approximately 
obeys the normal distribution according to the central 
limit theorem. 

On the other hand, the multiplicative process is often 
introduced to explain the growth process of living organ- 
isms. In the multiplicative process, the growth of living 
organisms is governed by the time evolution. 



Xt+i = atXt 

= at ■ at-i ■ ■ - ao ■ Xq, 
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where Xt and at are the size and the growth rate at time 
t, respectively. If at is independent of the size Xt, this 
process is called Gibrat's process, which reproduces the 
lognormal distribution of Xt as 

fi^t) = — -^^ exp 
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after long-time growth.^' Here, f{x), a, and T are the 
frequency divided by the total number of individuals, 
the standard deviation, and the median, respectively. In 
natural and social phenomena, the lognormal distribu- 
tion can be observed in various cases such as population 



distribution of villages in Japan, fragmentation of glass 
rods,®-' and so on. 

In this short note, we investigate the height distribu- 
tion of Japanese schoolchildren by data analysis. Our 
data analysis is based on the data of the school health 
survey by the ministry of education, culture, sports, sci- 
ence and technology, Japan (MEXT).9) This survey is 
performed yearly for the schoolchildren ranging from 5 
to 17 in age who belong to the schools randomly sam- 
pled from all the schools in Japan. The survey date 
ranges from April 1 to June 30 in each year. In 2006, 
the numbers of sampled schools and children were 7755 
and 695600, respectively. The data obtained from MEXT 
are histogram data classified into some classes of height. 
In the case of 17- year-old female students in 2006, the 
range of the data, from 130cm to 184cm, are splitted 
into 55 equal-sized (1cm) classes. Note that the interpre- 
tation of the data are subject to sampling errors because 
of the sample survey. 

The procedure of our analysis is as follows. First, for 
the data of each survey year, we fit the height distribu- 
tion in each age by the lognormal and the normal dis- 
tributions, respectively, by the use of GNUFIT routine 
implemented within GNUPLOT, which is based on the 
Marquardt-Levenberg algorithm for fitting. Second, to 
determine which distribution can be better approxima- 
tion, we calculate the sum of squared residuals from the 
data classified into m classes as 



El?. 



(5) 



where Ol and El are the observed and the estimated 
values for the j-th height class at each age z, respec- 
tively. Third, from the sums of squared residuals calcu- 
lated from both the lognormal and the normal distribu- 
tions, we calculate the ratio 
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where r\^^^'^ and R\" are the sum of residuals cal- 
culated from the lognormal and the normal distribu- 
tions, respectively. The height distribution is well ap- 
proximated by the normal distribution when loge; is 
larger than 0, while it is well approximated by the lognor- 
mal distribution when log is less than 0. We calculate 
€i for each age i to obtain the relationship between ti 
and age i in a given survey year. 

Figure 1(a) shows the relationship between age i and 
log Ei , where log is common logarithm, log = logj^g ■ Here 
we plot the results obtained from the data of both male 
and female students in 2006. We find the clear transi- 
tion from the lognormal distribution to the normal dis- 
tribution in both male and female students around 11 
or 13 years old. The timing of the transition is earlier 
in female students rather than that in male students. 
We commonly call this period puberty in which physical 
changes occur in a child's body to become an adult body 
capable of reproduction. For the adult, the height distri- 
bution seems to be approximated by both the lognormal 
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process of individuals obeys the multiplicative process 
before puberty. When an individual reaches the age 
around puberty, the growth process becomes rather addi- 
tive, which might cause the transition of the height distri- 
bution to the normal distribution. Modeling the growth 
process to reproduce such a transition is our near future 
task. Besides the modeling, we need to analyze the data 
of other countries to investigate whether similar tendency 
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Fig. 2. Relationship between age and loge for the cohort born 
in 1990. Sohd and chain lines are for male and female students, 
respectively. 



Fig. 1. Relationship between age and loge in (a) 2006 and (b) 
1970. Solid and chain lines are for male and female students, 
respectively. 

and the normal distributions. Limpert et al. pointed out 
that the similar tendency can be seen in the height dis- 
tribution of female adults. 

In the case of another year, Fig. 1(b) shows the result 
in 1970, where we also find clear transition from the log- 
normal distribution to the normal distribution although 
the timing of the transition is slightly different from that 
in 2006. These results would imply that the transition 
from the lognormal distribution to the normal distribu- 
tion in height distribution of schoolchildren occurs in any 
period in Japan. 

Next, we investigate the time evolution of the height 
distribution of the cohort born in 1990. We analyze the 
data ranging from 1995 to 2007. Notice that all the mem- 
bers belonging to the cohort are totally different because 
the sampled schools and individuals are different accord- 
ing to the survey year. Figure 2 shows the relationship 
between log e and age of the male and female schoolchil- 
dren who belong to the cohort. Also in this case, we can 
observe the similar transition from the lognormal distri- 
bution to the normal distribution. After the transition 
occurs, around 16 years old, the data can fit both distri- 
butions. 

Prom these analyses, we conjecture that the growth 



can be universally observed among schoolchildren. In ad- 
dition, the growth process of height is closely related to 
that of weight. We will report our analysis of the weight 
distribution in another occasion. 
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